A Visual Method for High-Dimensional Data Cluster Exploration

نویسندگان

  • Ke-Bing Zhang
  • Mao Lin Huang
  • Mehmet A. Orgun
  • Quang Vinh Nguyen
چکیده

Visualization is helpful for clustering high dimensional data. The goals of visualization in data mining are exploration, confirmation and presentation. However, the most of visual techniques serviced for cluster analysis are focused on cluster presentation rather than cluster exploration. Several techniques are proposed to explore cluster information by visualization, but most of them heavily depend on the individual user’s experience. Inevitably, it incurs subjectivity and randomness in the clustering process. In this paper, we employ the statistical features of datasets as predictions to estimate the number of clusters by a visual technique, HOV. This approach avoids the randomness and subjectivity of the user during the process of cluster exploration by other visual techniques. As a result, it provides an effective visual method for cluster exploration.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

High-Dimensional Unsupervised Active Learning Method

In this work, a hierarchical ensemble of projected clustering algorithm for high-dimensional data is proposed. The basic concept of the algorithm is based on the active learning method (ALM) which is a fuzzy learning scheme, inspired by some behavioral features of human brain functionality. High-dimensional unsupervised active learning method (HUALM) is a clustering algorithm which blurs the da...

متن کامل

A Prediction-Based Visual Approach for Cluster Exploration and Cluster Validation by HOV3

Predictive knowledge discovery is an important knowledge acquisition method. It is also used in the clustering process of data mining. Visualization is very helpful for high dimensional data analysis, but not precise and this limits its usability in quantitative cluster analysis. In this paper, we adopt a visual technique called HOV to explore and verify clustering results with quantified measu...

متن کامل

Visual Interactive Neighborhood Mining on High Dimensional Data

Cluster analysis is widely used for explorative data analysis, however, it is not trivial to select the right method and optimal parameters. Moreover, not all clustering methods can work with raw or dirty data. In this paper, we introduce an interactive data exploration tool, VINeM, which combines interactive mining with unsupervised tools by exploiting an intuitive neighborhood-based visualiza...

متن کامل

Coordinating computational and visual approaches for interactive feature selection and multivariate clustering

Received: KK Revised: KK Accepted: KK Abstract Unknown (and unexpected) multivariate patterns lurking in high-dimensional datasets are often very hard to find. This paper describes a human-centered exploration environment, which incorporates a coordinated suite of computational and visualization methods to explore high-dimensional data for uncovering patterns in multivariate spaces. Specificall...

متن کامل

Visual Hierarchical Dimension Reduction for Exploration of High Dimensional Datasets

Traditional visualization techniques for multidimensional data sets, such as parallel coordinates, glyphs, and scatterplot matrices, do not scale well to high numbers of dimension. A common approach to solving this problem is dimensionality reduction. Existing dimensionality reduction techniques usually generate lower dimensional spaces that have little intuitive meaning to users and allow litt...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009